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Loss of the capability to talk or hear applies psychological and social effects 
on the affected individuals due to the absence of appropriate interaction. 
Sign Language is used by such individuals to assist them in communicating 
with each other. The paper aims to report details of various aspects of 
wearable healthcare technologies designed in recent years based on the aim 


of the study, the types of technologies being used, accuracy of the system 
designed, data collection and storage methods, technology used to 
Keywords: accomplish the task, limitations and future research suggested for the study. 
The aim of the study is to compare the differences between the papers. There 
Glove : ; ; : : 
-_ is also comparison of technology used to determine which wearable device 
Pattern recognition gesture is better, which is also done with the help of accuracy. The limitations and 
recognition future research help in determining how the wearable devices can be 
Sign language sensor improved. A systematic review was performed based on a search of the 
literature. A total of 23 articles were retrieved. The articles are study and 
design of various wearable devices, mainly the glove-based device, to help 
you learn the sign language. 
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1. INTRODUCTION 

Living in the modern era of computerized world, where everything is straightforward and basic, a 
chunk of this world is losing the advantages this age has to offer [1]. The capacity to talk or the potency of 
speech is something we always underestimate. It is one of the best and incredible ways of sharing thoughts 
and feelings. It facilitates the communication with other individuals. Deafness is characterized as a level of 
hearing loss with the end goal that an individual cannot comprehend speech even when the sound is loud. As 
indicated by the World Health Organization (WHO), around 466 million individuals have hearing loss, 34 
million of those are kids. The WHO additionally assessed that more than 900 million individuals will have 
this disability by 2050. There are a few reasons for hearing loss: hereditary problems, certain contagious 
illnesses, complications at birth, chronic ear infections, the utilization of specific drugs, exposure to extreme 
noise and ageing. This communication boundary unfavorably influences the lives and social connections of 
deaf individuals [2]. 

Human gestures are an effective and amazing method of communication. These are used to express 
a persons’ emotions. A sign language is a language that utilizes manual gestures to pass on important 
information, instead of using speech. Nonetheless, there are interaction hindrances between hearing 
individuals and deaf people because deaf people will be unable to talk and hear or that hearing people will be 
unable to use sign to express themselves. This communication disparity can have an adverse effect on the 
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lives of the deaf people. Two customary methods of communication between deaf people and hearing people 
who do not comprehend gesture-based communication: through mediators or text. 

Innovation has diminished this mismatch through procedures that converts gesture-based 
communication into speech. These frameworks can be extensively ordered in the sorts of system, that can be 
used to change over sign language into speech. These are glove-based framework and vision-based 
framework. In glove-based frameworks, an individual sign's while conversing is transferred to the personal 
computer (PC). The real time sign is matched with the data set that contains all the signs, that were added at 
first to the framework. After comparing with the right sign, the information is passed onto text to sound 
converter, where the information is changed over to sound from sign. While the vision-based framework uses 
camera to recognize the gestures made by the hand and the body. However, vision-based framework requires 
lot of processing on the image such as color segmentation, image filtering and boundary detection. 

This study tries to review the sign language gloves by doing a comprehensive assessment and 
evaluation based on a comprehensive synthesis of sensor glove papers. We present this thorough review that 
highlights the important achievements, sensor positioning, accuracy, limitations, methodology used, time 
taken to process and stress the challenges and prospects for this developing area of research. 


2. METHOD 

This study offers a methodical analysis of literature centered on the smart gloves for mute 
individuals. Several evolving ideas have been considered to enhance an insight of advances in the 
information on this crucial issue. This study tries to connect this disparity in the literature by doing a 
comprehensive assessment and evaluation based on a comprehensive synthesis of sensor glove associated 
research published. The following are a synopsis of the key opinions evolving from this literature assessment, 
— Sensors used 
— Sensor positioning 
— Sign language 
— Accuracy and efficiency of the system 
— Time taken to process 
— Limitations 
— Methodology used 

This paper categorized relevant articles by using an approach called keyword search. Various 
keywords were found and explored on JEEE explore. The main aim of this research is to understand the 
current research level on sensor gloves that can be used to interpret sign language. After rummaging through 
IEEE explore, 23 editorials were found that met the benchmarks for this analysis. All the articles were 
comprehensively reviewed by the authors to discover familiar factors. These factors were associated to find 
the differences in each article. The key aim of the article was also uncovered when processing these articles. 

The results section is divided into 3 sections that are aim of articles, sensors, and feature comparison. 
In aim of articles, the major goal of the papers is discussed. While in sensors, comparison is done between the 
sensors used in these gloves and what impact that has on the result. And finally, in feature comparison, analysis 
is done to differentiate the differences in the papers that are achieving the same outcome. 


3. RESULTS AND DISCUSSION 

Several articles that have been reviewed by the authors have a similar main goal that is to design the 
glove that can translate the sign language into text. The majority of articles use flex and accelerometer to 
identify gestures. However, some of them use different types of gloves like touch sensor glove and surface 
electromyography (SEMG). These gestures are then either converted to text or to speech. To convert the 
gestures into meaningful data the articles have used various approaches like machine learning, database and 
different algorithms. 

A good number of articles used microcontroller to process the sensor data, by dividing the sensor 
values into ranges for each gesture [3] and [4]. However, this methodology could cause error when the 
sensors are not in range [1], [5], [6] and [7]. A machine learning algorithm was used to determine the gesture 
in some of the articles [2] and [8]. While some use lookup table to determine the gestures [9]-[11], and [12], 
the article [13] and [14] used a data segmentation method. Data segmentation is based on a threshold-based 
method to extract 21 features for each data segment. As the classification model, the authors used the 
multivariate Gaussian distribution with diagonal covariance matrix. The multi-objective Bayesian framework 
for feature selection (BFFS) is implemented, with two objectives being discriminability power and fault 
tolerance maximization, to improve the recognition accuracy and reduce the model complexity by selecting a 
set of the 21 features. Natesh et al. [10] designed a glove for eight different sign languages. They were 
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Australian sign language (AUSLAN), British sign language (BSL), New Zealand sign language (NZSL), 
Indian sign language (ISL), deaf blind sign language (DBSL), Czech sign language (CSL), British, Australian 
and New Zealand sign language (BANZSL) and standard manual sign language (SMSL). 

The authors in [15] and [16] used sEMG to get the data, which was processed for segmentation. 
After segmentation, features were extracted. This was done for the inertial sensors and static sensors. Then 
both these features were cascaded to select the best subset of the feature. After determining the subset, it was 
passed through classification model to recognize the gesture. 

In [17] and [18] features were extracted from the sensors. These features were passed through a 
support vector machines (SVM) based classifier to recognize the gesture made. In [19] and [20] gesture 
recognition algorithm is devised which checks for contact sensor if they provide a similar gesture. if the 
gesture is not found, the code looks for any resemblance regarding the flex sensors. These two stages can 
determine almost all the gesture except the dynamic ones. To determine the dynamic gestures the inertial 
sensor is used. In [21] and [22] no dynamic gestures were recognized as the glove could only recognize static 
gestures. To process the gesture, three layers of nodes were used. The first layer hard 7 nodes which were the 
values from the seven sensors. These 7 nodes were converted into 52 nodes by applying weight on them. The 
final layer had 26 nodes; each node denotes to an alphabet. The authors in [23]—[31] and [32] stored the 
sensor values into a file which was loaded into a LabView program. This program receives the values from 
the file and matches this data with gestures close to American sign language (ASL) gestures. In article [33]— 
[36] and [37], the gestures for each letter were processed by storing the voltage levels of the sensors into the 
database. To determine a gesture, the program compares the sensor values to the ones in the database. 

In [38] and [39], a learning mode is used to train the system. the system stores values from the gloves 
in a comma-separated values (CSV) file 20 times. These readings are used to evaluate the range of each gesture 
for this specific user. Whenever the user starts communicating using the glove, the program uses the sensor 
values and look for the value of the gesture made by the user in the CSV file. The authors in [40] and [41] used 
an SVM base classifier. The sensor values for each gesture are used to train the SVM classifier. The model 
generated is used to predict the gesture made. In [42], [43] and [44], online gesture recognition model is 
proposed which consists of two parts: segmentation with threshold-based and classification with a probabilistic 
model. The system determines the sensor values and uses the threshold to determine which gesture it is. To 
validate the gesture probabilistic model is used, a very inefficient method was used in [35], [45] and [46], where 
the gestures were recorded manually. The user has to make the gesture and select the alphabet. This solution 
will not detect the gesture if there is a slight difference in the sensor readings. In [47]-[49] and [50] the sensor 
values were obtained and passed through multiple algorithms to evaluate the accuracy. The glove was able to 
achieve more accuracy than previous experiments. Figure 1 shows the overall design for sign language 
recognition for most systems. Figure 1 shows the overall design for sign language recognition for most systems. 
The articles presented in this review have been summarized in Table 1 (APPENDIX). 


| 
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Figure 1. Block diagram of common sign language recognition system 


4. CONCLUSION 

This review was centered on the characteristics of the Glove based wearable devices, it uncovered 
numerous concepts and how they are utilized. The reach and complexity of the articles highlights the 
algorithms and sensors used to develop these gloves. Most of the gloves did not have an accelerometer to get 
the dynamic gestures. They have used different methodology. However, this methodology of using flex to 
identify dynamic gestures could produce ambiguous results. Some of the gloves could not process data in 
real-time, which could be a real drawback as the application is to talk in real-time. Some gloves required 
training to be used, which is a problem when the glove is constantly switched between multiple users. 
Various articles did not give adequate information about their test, test plan and accuracy of the system. 
Certain gloves had a delay in reading the gestures, so the hand should stay still for a short period of time 
before moving to the next gesture to allow for the system to interpret it. While some places delay was 
introduced to process the data. This is very inconvenient as the communication speed would be quite slow. In 
terms of accuracy, half of the papers reviewed achieved an accuracy of 90% or more. While six of them had 
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accuracy of 70% to 90%. Some paper had multiple accuracies for different algorithm or design. One of the 
major factors in getting better accuracy was use of either contact sensors or accelerometer to get dynamic 
gestures. The other factor that influenced accuracy was the algorithm used. The algorithms with higher 
accuracy were usually the ones with machine learning algorithm or using different algorithms like k-nearest 
neighbors (k-NN), segmentation and gesture recognition. The ones with the lower accuracies were the one 
which used raw values to determine the gesture. Another factor was the hand size that could have an impact 
on the accuracy. If the size of the hand is big, it would mean that flex sensors would bend properly. Proper 
bending of flex sensors would create a higher range of values which would increase the accuracy. Moreover, 
some of the papers were not able to determine the accuracy or did not mention it in the paper. Future 
improvements can be done on this topic, by designing the glove based on the user to yield better results or 
compare different hand sizes to know the level of inaccuracy. This could be helpful in determining whether 
glove will work with others or not. Use of different sensors could be another improvement, this could help in 


making a more flexible glove. 


APPENDIX 
Table 1. Feature comparison 
Paper Sign Sensor positioning/ Accuracy/ Method used Time taken limitations Future study 
Language Components used Efficiency to process proposed 
the input 

1 ASL 26 Four flex sensors. Not mentioned Todevelopacheap Not Accuracy Making the 
alphabets sensory data glove mentioned limited by glove wireless. 

to help disabled size of the Addition of 

people hand accelerometer. 

communicating Speaker to listen 
to the converted 
gestures. 

2 ASL 26 Two flex sensors are 95% A machine learning Not Not Increase in the 
alphabets used for thumb and algorithm mentioned mentioned size of database. 
and 15 pinky, and a pair of Integrating the 
words. sensors for each glove with other 

other finger devices at home. 

5 ASL 26 The flex sensors and 74% Five flex sensors, 0.74 Soldering Wireless gloves. 
alphabets. the accelerometer an GY61 seconds to defect on 

sewed to a white accelerometer, translate thumb’s flex 
cotton glove Arduino Mega the gesture — sensor 

2560, micro-SD into text 

card reader module, and speech 

liquid crystal 

display (LCD). A 

gesture recognition 

algorithm 

9 American A leather glove with 90% Lookup table, Not Only static Improve 
sign 11 flex sensors, one template matching mentioned gestures. accuracy of the 
language for each finger (5) along with system 
(ASL) and and one for each statistical pattern 
the abduction (4). Two recognition 
Pakistan extra sensors are 
sign used for measuring 
language the roll and pitch of 
(PKL) the wrist. 

13 ASL 26 Five flex sensors 76.1% for the Data segmentation Not Ambiguity Extra contact 
alphabets, connected in parallel Multivariate is based on a mentioned in gestures sensors to 
full stop, with five contact Gaussian threshold-based lea dig to remove 
space, and sensors and an distribution. method. error in ambiguity. 
resting accelerometer. 77.9% for the Multivariate determining 
(fully- There are also seven Multi- Gaussian the letter. 
stretch fabric electrical objective distribution with 
finger). contacts, two Bayesian diagonal covariance 
Total of 29 _ positives, placed on framework for —_ matrix for 
gestures. the index andthumb _ feature classification. 

fingers, and five selection The multi-objective 
negatives, placedon (BFFS) Bayesian 


lower, top, front of 
the middle finger, 
index finger, and 
pinky finger. 


framework for 
feature selection 
(BFFS) is 
implemented to 
reduce complexity. 
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Paper Sign Sensor Accuracy/ Method used Time taken to limitations Future study 
Language positioning/ Efficiency process the proposed 
Components used input 
33 ASL 26 Flex and contact 91.54% It uses the k- 500ms. Only identifies | Updating the 
alphabets. sensor the length nearest Data 26 alphabets. system to 
of the fingers neighbors (k- segmentation identify words 
NN) algorithm was used to and sentences. 
to identify the identify change Reduce the 
alphabets. in gesture. time taken to 
Database used to identify the 
store various alphabets to 
alphabets. less than 
500ms. 
15 ASL SIGN _ Four major 96% The data A quiet period Large number The paper 
80 gestures muscle groups collected from of 2-3 s of gestured suggests that to 
are chosen to three sessions of between may be recognize 
place four the same subject _ gestures is difficult to continuous 
channel sEMG are put together required predict using sentence, a 
electrodes: and a 10-fold the method different 
1) extensor cross validation suggested in segmentation 
digitorum, is done for the this paper. technique or 
2) flexor carpi data collected Wearable other 
radialis longus, from each inertial sensor _ possibility 
3) extensor carpi subject and sEMG- models should 
radialis longus, separately. based sign be considered. 
and language 
4) extensor carpi recognition 
ulnaris. (SLR) system 
The inertial is that the 
measurement unit facial 
(IMU) sensor is expression is 
worn on the not captured. 
wrist, where a 
smart watch 
is usually placed. 
3 ASL 26 It uses 8 92% not mentioned not mentioned 
Alphabets independent Uuses python Recognizes 
and 0-9 capacitive touch : : 
code and RPi alphabets 
numbers sensors. 5 placed aa 
at fingertips and firmware. within 0.7sec. 
3 sensors placed There was a 
between index, countdown of 3 
middle, ring and sec between 
pinky fingers. two gestures. 
10 All eight The glove The two Gesture not mentioned —_ not mentioned 
sign consists of 9 flex The system subsystems recognition 
languages and 8 contact efficiency was interconnected system is 41.1 
AUSLAN, sensors placed in through a pair of — milliseconds in 
BSL, appropriate found to be TIs’ CC2541 IM and 151.5 
BANZL, positions on 80.06% in bluetooth low milliseconds in 
NZSL, ISL, _ fingertips and identification energy (BLE) EIM. It is thus 
DBSL, flex sensors on mode (IM) and modules. capable of 
CSL, the outer region was enhanced recognizing 24 
SMSL of the fingers. to 93.16% in gestures per 
second in IM 
enhanced : and 6 gestures 
identification per second in 
mode (EIM) EIM 
17 ASL 26 Five flex-sensors The proposed 10 sec forevery notmentioned Future work 
Alphabets along the length Accuracy Rate system in this letter with a 3s proposes 
on the outer paper utilizes gap between design of a 


surface of each 
finger. In case of 
the second 
version two 
additional 
pressure sensors 
were placed and 
on the left side of 
the first joint of 
the middle finger, 


of 65.7% can 
be achieved 
without 
pressure 
sensors and 
98.2% 
accuracy with 
pressure 
sensors on the 
middle finger. 


five flex-sensors, 


two pressure 
sensors, and a 
three-axis 
inertial motion 
sensor 


two signs 


smaller sized 
printed circuit 
board, the 
inclusion of 
words and 
sentences at the 
sign language 
level, and 
instantly 
audible voice 
output. 
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Paper Sign Sensor Accuracy/ Method used Time taken limitations Future study 
Language positioning/ Efficiency to process proposed 
Components used the input 

19 American Flex sensors are Gesture Not Not Not mentioned 
sign connected on the —- The system recognition mentioned mentioned 
language fingers (Dorsal algorithm is used. 

(ASL) 26 side of hand). poe First it checks the 
alphabets Contact sensors accuracy of contact sensors 
are connected at 92 % on and see if there is 
multiple places trained ASL any equivalent 
depending onthe testers while gesture. Then it 
gestures. While it should compares with 
the inertial sensor —g 1% on flex sensor 
is connected on reading to refine 
the tip of ring ae the gestures. And 
finger. testers, finally, it uses 
inertial sensor to 
finalize the 
gesture. 

21 American Seven sensor Three-layer Sampling = Does not Use of camera to 
sign gloves are used. 88% algorithm isused, 4timesper have determine sign 
language Five sensors are first layer passes second. dynamic using facial 
(ASL) 24 placed on fingers the raw sensor 0.75s gestures. Or expressions. Use 
alphabets and thumb. One values to the required to gestures with — of speech engine to 
plus two sensor is to second layer. determine two hands. speak the text from 
punctuation measure the tilt Second layer has the letter. the gestures. Extra 
symbols. while the last one 52 nodes. It sensors to 

is to measure the applies weight to determine the body 
rotation the input and language to aid in 
passes to the third sign determination. 
layer. The third 
layer consists of 
26 nodes, each 
node denoting one 
alphabet. 

23 American Eighteen sensors A Labview Cannot Cannot Development of 
sign are used on the 90% program collects process process real- —_ wearable glove 
language glove. Two data and saves it real-time time that recognizes and 
(ASL) 26 resistive bend to a file. This data translate sign 
alphabets sensors on each is analyzed and language to spoken 

finger, four used to train English. And 
abduction sensors neural network. translating spoken 
and sensors While another English to sign 
measuring thumb program uses the language. 
crossover, palm data from glove to 
arch, wrist analyze the ASL 
flexion and sign. After 
abduction. determining the 

sign, the program 

plays the sign. 

34 American It has six flex Every gesturehas Not Not Not mentioned 
sign sensors placedon = 3, different voltage mentioned mentioned 
language fingers, thumb levels, the system 
(ASL) 26 and wrist, and compares voltage 
alphabets three contact level from gesture 

sensors placed on made by the glove 
fore finger, and identify the 
middle finger and alphabet. 

thumb. 

38 American Five flex sensors Static = The data is 1000ms. Mismatches Use of contact 
sign are placed on 95% collected from the in words sensors on the tip 
language fingers and glove and with similar of the finger. 
(ASL) and thumb. Inertial Dynamic = processed using gesture. Using a left-hand 
Arabic sign sensor mpu6050 88% Arduino. And glove. Increasing 
language is also connected. outputted using a the size of the 
(ArSL) graphical user glove. Making the 


interface (GUI) 
program made 
using python3. 


system 
multilingual. 
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Paper Sign 


Language 


Sensor positioning/ 
Components used 


Accuracy/ 
Efficiency 


Method used Time 
taken to 
process 


the input 


limitations Future study 


proposed 


24 American 
sign 
language 
(ASL) 26 
alphabets 


25 American 
sign 
language 
(ASL) 26 
alphabets 


26 Indian sign 
language 


27 American 
sign 
language 
(ASL) 26 
alphabets 


40 American 
sign 
language 
(ASL) and 
Indian sign 
language 
(ISL) 


Five flex sensors are 
placed on fingers and 
thumb. 


Five flex sensors are 
placed on fingers and 
thumb. 


Five flex sensors are 
placed on fingers and 
thumb. 


Five flex sensors are 
placed on fingers and 
thumb. 


Five flex sensors of 
fingers and thumb 
and an MPU6050 


sensor 


Not 
mentioned 


95% 


Limited 
by hand 


size 


Not 
mentioned 


ASL = 
98.91% 
ISL= 
100% 


The sensors give Not 
their raw values to mentioned 
data acquisition 

cards (DAQ) card 

and Lab view 

program. The 

program converts 

the letter to binary 

code. The binary 

code is then used to 
translate letters and 

words. The 

program then 

converts it to text 

and audio. 

The system Not 
determines the mentioned 
alphabets from the 

flex sensors, then 

uses accelerometer 

and contact sensors 

if the gesture is not 

found. 

The sensors give Not 
their raw values to mentioned 
Arduino. The 

Arduino recognizes 

the gesture and 

outputs it to LCD 

screen and send it 

to Bluetooth 

module. A 

smartphone can be 
connected to the 

Bluetooth module 

to get the gesture 
recognized. 

The sensors give Not 
their raw values to mentioned 
Arduino nano. The 
Arduino nano 

transmit the data 

through transmitter 

to Arduino mega. 

The Arduino mega 
converts the signals 

into gesture and 

displays the gesture 

on LCD screen. 

Raspberry pi is 

used to play the 

sound of the sign 
recognized. 

The data from the 2s 
sensors is 

processed by 

feeding it into an 

SVM based 

classifier. After 
determining the 

gesture, the 

Arduino sends the 

data through 

Bluetooth which is 

played on speaker 

after processing it. 


Not Replace flex 

mentioned sensor to increase 
efficiency. Make 
the glove 
wireless. Make 
the program 
standalone to 
make it portable. 


Not Add more 

mentioned sensors like 
gyroscope. 
Enhance speech 
synthesis. 
Wireless 
communication 


Add more 
sensors to 
recognize full 
sign language. 


The accuracy 
of the system 
is limited by 
the hand size. 
Smaller 
hands can be 
more 
accurate due 
to larger bend 
of the sensor. 
Does not 
have 
dynamic 
gestures. 
Does not 
have 
dynamic 
gestures. 


Increase scope by 
adding another 
glove 


Increase number 
of sensors. Add 
additional glove. 


Does not 
have all the 
gestures. 
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Paper Sign Sensor Accuracy/ Method used Time taken _ limitations Future study 
Language positioning/ Efficiency to process proposed 
Components the input 
used 
42 American sign Flex sensors are 73.6% An online gesture 0.5s Less Compare each 
language connected on the recognition model is used features detected word 
(ASL) 26 fingers and which consists of two that disrupt with an 
alphabets plus — thumb. While the parts segmentation and the extensive 
two inertial sensor is classification. accuracy vocabulary set 
punctuation connected on the Segmentation is used to to increase 
symbols. back of the hand. determine the gesture efficiency. 
made and classification is 
done to form the 
sentences. The recognized 
gestures will be sent to 
the speaker after the 
sentence finishes. 
35 American sign _- Flex sensors are 86% The system has two 3s Not Use of 
language placed on the modes, teach, and learn mentioned additional 
(ASL) 26 fingers and mode. In teach mode the sensors and 
alphabets thumb. user can store the gestures camera to aid 
made, while in learn in detection. 
mode the user can check Use of speech 
if the gesture made is engine to play 
correct or not. the translated 
text. 

47 American sign _- Flex sensors are 98% The data from the sensors Not Not Not mentioned 

language placed on the is processed by mentioned mentioned 
(ASL) 26 fingers and comparing it to the 
alphabets thumb. existing voltage level to 

determine the gesture 

made. After recognizing 

the gesture, it’s played on 

the mp3 player module. 
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